CDS
Accession Number | TCMCG075C05185 |
gbkey | CDS |
Protein Id | XP_007041847.2 |
Location | join(3006636..3006857,3006961..3007169,3007270..3007336,3007416..3007508,3007610..3007753,3007927..3008021,3008121..3008226,3008321..3008405,3008496..3008614,3008708..3008875,3008966..3009141,3009274..3009383,3009499..3009608,3009696..3009803,3009892..3009993,3010117..3010262,3010371..3010716,3010804..3010929) |
Gene | LOC18607554 |
GeneID | 18607554 |
Organism | Theobroma cacao |
Protein
Length | 843aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007041785.2 |
Definition | PREDICTED: beta-galactosidase 13 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGCGGTGTCAGGCCGTATACTTTTAAGAATAACCCTTTTCACCTTGCTGGTTGCTTCCAGCATTGCGCATGACAAGAAGGATCATGACGATGGGGATGACCATAAGGCTGAGAACAAGGTCAACCAGGGCGTGACCTATGATGGAAGGTCTGTGATCATCAATGGCAAAAGAGAGCTGCTTTTCTCGGGCTCCATTCATTACCCTCGCAGCACCCCAGATACCTGGCCCGACCTCCTCACAAAAGCTAAATATGGAGGTCTGAACGTGATCCAGACGTATGTTTTCTGGAACATTCATGAGCCGATTGAGGGTCAGTACAATTTTGAAGGGCAATATGACTTGGTGAAGTTCATCAAGTTGATTGGGGAGCATAAAATGTATGCGACCCTCCGGGTTGGCCCGTTTATTCAGGCTGAATGGAACCATGGAGGATTACCATATTGGCTAAGAGAGGTCCGCAACATCACATTCCGCTCTGACAATGAACCATTCAAGCATTACATGAAAAAATTCGTTACGATGATTATTGATATGATGAAGAAGGAGAAGTTGTTTGCTTCACAAGGAGGCCCTATCGTTTTATCACAGATCGAGAATGAGTACAACACCATTCAACTAGCATTCAGAGAACTTGGAGACAGTTATGTTCAGTGGGCAGGAAAGATGGCCGTTGGCTTAAACACCGAAGTCCCATGGATCATGTGCAAGCAGAGGGATGCCCCAGATCCAATTATTAATACATGCAATGGAAGACACTGCGGAGATACTTTCACAGGTCCAAATAGGCGTAACAAGCCTTCATTGTGGACTGAGAACTGGACTGCACAGTATAGAGTATTCGGAGATCCACCATCTCAAAGGTCAGCTGAAGATTTGGCGTACTCGGTGGCTCGCTTCTTCTCCAAAAATGGATCTCTGGTCAACTACTACATGTACCATGGTGGCACAAATTATGGCAGAACAAGCGCTGCTTTTACAACAACTCGCTACTATGACGAAGCCCCTCTTGATGAATATGGTTTACAAAGGGACCCAAAATGGGGCCACCTCAAGGATCTTCACAAGGCCCTAAATTTGTGCAAAAAGGCTCTGCTTTGGGGATCTCCTACCGTCCAAAAGCTGGGTCCAGACCAAGAGGTCCGAACCTACAAGCAACCTGGAACTTCTCTCTGTGCAGCTTTCTTGGCCAACAATGACACCCAGAACGCGCAAACATTCCATTTCAGGGGTAAGCAATATCGCCTACCAGCTCGCTCCATCAGTATCCTCCCTGACTGCAAGACCGTGGTTTACAACACTCAGATGATTACGGCACAACATAACACGAGAAATTTTGTAAGATCAGCAACTGCAAACAAGAACTTTAACTGGCAGATGTACAAGGAATATGTTCCAACCCAACTTGGATCTATGACCAAGGAACCAATGGAGCTTTATGAGTTGACCAAAGATACAACGGATTATGCTTGGTATACAACTAGCATTGAATTGGGTCCACGTGACTTGCCAATGAAAAAAGAAATCTTCCCAGTTTTACGGGTTGCAAGTCTTGGCCATGGACTCCTTGCTTTTGTAAATGGCGAATATATAGGTTTTGCACATGGGAGCAAAGTTGAGAAGAGCTTTGTCTTCCAGAAACCTGTAAAGTTGAAGGCAGGGGTTAACCAAATTACACTCTTGGGGACTTTAGTGGGACTTCCAGATAGCGGAGCCTACATGGAGCATAGGTTTGCTGGGCCTCGTTCTATCACCATATTAGGTTTGAACACCGGAACACTTGACCTGTCAGTAAATGGCTGGGGACATCAGGTTGGACTGAATGGAGAAAAGAAGAAAATATATACCGAGAAAGGCTCAACGAAGGTAGAGTGGAGGAAGCTTAGTGAATCACCAGCTTTAACATGGTACAAGGGATACTTTGACACCCCAGAAGGAAACAACCCAGTTGCCATCCGGATGACTGGTATGGGGAAAGGTATGGTCTGGATCAATGGTCAGAACATTGGCCGATATTGGATGTCTTACCTTTCTCCTCTAAAGCAGCCTTCTCAATCCGAGTACCAAATCCCAAGATCGTTCCTCAAGCCGACGCAGAATCTCATTGTTATATTGGAGGAGCAGGAAGGCAATCCGAAAGATGTTGAGATCCTGCTAGTTAATAGAGATACAATCTGCAGTTACGTAACCGAATATCATCCGCCATCGGTGAGGTTATTCGAAAGCAAAGGTGGCAGCTTGCGAGCCAAGGTGGATGATTTGAAACCAAAGGCTGAACTGACTTGTCCGAACCAGAAAAAGATCGTTACCGTAGAGTTTGCGAGCTTTGGTGATCCTTTTGGTGCCTGCGGAAGCTACTCCCTTGGAAATTGCACGTTCCCCGTATCCAAGAAAGTTGCGGAAAAGTTTTGTCTGGGAAAAACCAGCTGCCAAATTCCATTGGACGCTGAGGATTTCGACAAGCAAAACGATGCCTGCCCACATATGAAAAAGGCTCTTGCCGTCCAAGTCAAGTGTGCTTACAAGAAGTAA |
Protein: MAVSGRILLRITLFTLLVASSIAHDKKDHDDGDDHKAENKVNQGVTYDGRSVIINGKRELLFSGSIHYPRSTPDTWPDLLTKAKYGGLNVIQTYVFWNIHEPIEGQYNFEGQYDLVKFIKLIGEHKMYATLRVGPFIQAEWNHGGLPYWLREVRNITFRSDNEPFKHYMKKFVTMIIDMMKKEKLFASQGGPIVLSQIENEYNTIQLAFRELGDSYVQWAGKMAVGLNTEVPWIMCKQRDAPDPIINTCNGRHCGDTFTGPNRRNKPSLWTENWTAQYRVFGDPPSQRSAEDLAYSVARFFSKNGSLVNYYMYHGGTNYGRTSAAFTTTRYYDEAPLDEYGLQRDPKWGHLKDLHKALNLCKKALLWGSPTVQKLGPDQEVRTYKQPGTSLCAAFLANNDTQNAQTFHFRGKQYRLPARSISILPDCKTVVYNTQMITAQHNTRNFVRSATANKNFNWQMYKEYVPTQLGSMTKEPMELYELTKDTTDYAWYTTSIELGPRDLPMKKEIFPVLRVASLGHGLLAFVNGEYIGFAHGSKVEKSFVFQKPVKLKAGVNQITLLGTLVGLPDSGAYMEHRFAGPRSITILGLNTGTLDLSVNGWGHQVGLNGEKKKIYTEKGSTKVEWRKLSESPALTWYKGYFDTPEGNNPVAIRMTGMGKGMVWINGQNIGRYWMSYLSPLKQPSQSEYQIPRSFLKPTQNLIVILEEQEGNPKDVEILLVNRDTICSYVTEYHPPSVRLFESKGGSLRAKVDDLKPKAELTCPNQKKIVTVEFASFGDPFGACGSYSLGNCTFPVSKKVAEKFCLGKTSCQIPLDAEDFDKQNDACPHMKKALAVQVKCAYKK |